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A COMPUTUER  MANAGEMENT  SYSTEM  FOR  APPLE  (”MALUS  X 
DOME ST I CA"  BORKH.)  GERMPLASM  WITH  RESISTANCE 
TO  DISEASES  AND  ARTHROPOD  PESTS 

H.F.  Goonewardene,  V.  Rudkevich,  R.  Grosso,  and  E.B.  Williams1 
INTRODUCTION 

An  important  contribution  to  apple  production  could  be  made 
through  the  development  of  germplasm  with  the  genetically 
inherited  traits  for  resistance  to  one  or  several 
economically  important  pests.  This  approach,  if  successful, 
would  minimize  the  dependence  on  the  multiple  applications  of 
several  pesticides  for  producing  cosmetically  acceptable 
apples,  the  impact  on  pollution  of  the  environment,  and  the 
dependence  on  fossil  fuels.  A large  body  of  data  will 
necessarily  be  acquired  during  such  a task,  and  rapid  access 
to  the  data  base  is  essential  for  its  exploitation.  The 
feasibility  of  using  contemporary  computer  technology  in 
handling  the  necessary  information  has  been  reported  by 
Ghaderi  et  al.  (1984).  The  only  available  report  on  computer 
usage  in  tree  fruits  for  other  than  developing  an  inventory 
is  that  of  Bell  et  al.  (1980),  who  developed  a COBOL  program 
for  tracing  pedigrees  from  a data  base  developed  for 
pears,  Pyrus  sp.,  and  those  of  Van  der  Zwet  et  al.  (1979)  and 
Blake  and  Van  der  Zwet  (1979)  who  developed  a data  base  for 
use  in  the  U.S.  Department  of  Agriculture's  pear  breeding 
program. 

We  chose  Pascal  from  among  the  well  defined,  high-level 
languages,  for  our  work  on  coding  and  programming.  Our 
programs  were  developed  for  use  with  the  main-frame  computers 
(CDC6500  and  6600)  at  Purdue  University,  West  Lafayette,  IN. 

The  data  base,  files  and  programs  are  illustrated  in  figure  1 
and  described  as  follows: 


^-Goonewardene  is  a research  entomologist 
with  the  U.S.  Department  of  Agriculture, 
Agricultural  Research  Service;  Rudkevich  and 
Grosso  are  student  programmers,  and 
Williams  is  Professor  (Emeritus),  Depart- 
ment of  Botany  and  Plant  Pathology,  all  at 
Purdue  University,  West  Lafayette,  IN  47907. 
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Figure  1. 

Flow  chart  of  the  general  structure  of  the 
data  base  and  program  sequences  for  its 
utility  in  host  resistance. 

* 

Are  user  references  that  store  the 
complete  names  of  all  apple  selections 
in  this  data  base. 
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The  data  base  is  composed  of  information  recorded  over  a 
period  of  30+  years  and  contained  in  the  following  files: 


XPEDAB 

APTRACE 

HORT 

INTEST 

ERTEST 

The  contents  of  each  file  is  described  below: 

1.  File  name:  XPEDAB  - an  unsorted  master  file  of  the 
complete  names  of  all  entries  and  their  corresponding 
abbreviated  names.  Each  abbreviated  name  has  a limit  of  12 
characters.  Blanks  are  provided  if  the  abbreviated  names 
have  less  than  the  limit  of  12  characters.  The  complete  name 
is  limited  to  121  characters.  Providing  blanks  to  reach  this 
limit  is  not  necessary.  An  example  of  the  entries  in  the  file 
XPEDAB  is  as  follows: 

Column  1 13  16 


Abbreviated  name 


Complete  name 


ANTMON 


ANT0N0VKA  MONASIR 
HENRIETTA  CROSBY 


Note-  An  unduplicated,  unique  name  and  abbreviation  is  used 
for  each  entry. 

2.  File  name:  APTRACE  - an  unsorted  master  file  of 
progeny/parent  combinations  that  is  the  source  of  several 
other  oper  itions.  The  name  of  a progeny  or  a parent  is 
limited  to  12  characters.  Blanks  are  provided  for  names  that 
have  less  than  12  characters  to  reach  this  limit. 

An  example  of  the  entries  in  the  file  APTRACE  is  as  follows: 


Column  1 


13 


25 


Progeny 


Female  parent 


Male  parent 


HER3T17  6 
PWR18T28 


GDEL 

JON 


COOP  17 
OR53T52 


3.  File  name:  HORT  - an  information  file  on  the  origin  of 
each  germplasm  entry,  along  with  the  associated  horticultural 
characteristics  and  its  response  to  disease. 
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2 

A.  Gerraplasra  entries:  Each  entry  is  identified  in  columns 
1-10  by  (1)  name,  for  example,  Jonathan,  (2)  numeric  code, 
for  example  2907,  or  (3)  alphanumeric  identification,  for 
example  HAR21T200,  depending  on  whether  the  individual  entry 
was  a cultivar,  progeny,  or  selection.  Germplasm  obtained  as 
a plant  introduction  is  identified  by  the  the  prefix  PI 
followed  by  a letter  plus  5 or  6 numbers,  for  example 
PIB57270^  or  PI104727.  Germplasm  entries  from  the  Cooperative 
apple  breeding  program  among  Purdue  (P)  University,  W. 
Lafayette,  IN,  Rutgers  (R)  University,  New  Brunswick,  NJ,  and 
the  University  of  Illinois  (I)  Champaign-Urbana,  IL,  (PRI) 
are  identified  differently,  depending  on  whether  they  were 
progenies  or  selections.  If  they  were  progenies,  the 
identification  was  by  numbers  with  2,  3,  or  4 digits  denoting 
the  progeny  or  number  of  the  cross,  such  as  49,  or  703  or 
1789,  followed  by  a hyphen  and  a number  to  identify  the 
seedling,  such  as  1789-1.  Progeny  selections  were  identified 
after  vegetative  propagation  by  the  farm  and  block  name 
composed  of  one  or  two  letters,  1-3  digit  row  numbers,  and  1- 
3 digit  tree  number  such  as  CBR1T29. 

Farm  and  Block  listings  were  as  follows: 

C = Clark  Farm  (Blocks  A,  B,  & C;  DG  = Douglas;  H = Hinsley; 

0 = O'Neall  Farm  (original  seedlings);  OB  = O'Neall  Farm 
(Block  B);  P = Poultry  farm;  St  = Storage  block;  T = 
Throckmorton  (Block  N = North;  Block  S = South)  and  9A  = 

Block  9A  at  O'Neall  Farm. 

The  entry  CBR1T29  refers  to  CB  = Clark  Farm,  Block  B,  R1  = 
row  1 and  T29  = tree  29. 

Germplasm  originating  in  New  Jersey  was  identified  by  the 
prefix  NJ  or  DIR  followed  by  a maximum  of  5 digits  or 
characters,  for  example,  NJ44,  D1R8T352,  and  those  from  New 
York  had  the  prefix  NY  followed  by  5 digits  with  a separating 
hyphen  followed  by  more  digits  to  indicate  the  tree  numbers, 
for  example  NY55166-23. 


2The  term  "germplasm  entry"  is  used  in 
this  paper  to  indicate  a progeny,  or  a 
selection  or  a cultivar. 

2B57270  and  similar  entries  are  B numbers 
assigned  to  accessions  during  the  quaran- 
tine process  before  virus  indexing, 
certification,  and  release.  Since  these 
accessions  are  at  the  Plant  Inventory 
Stations,  we  have  added  the  prefix  PI 
to  the  B numbers  for  this  publication. 
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B.  Horticultural  and  disease  incidence  data  were  coded  as 


shown  as 

Column/ s 

31-32 

33-34 

35-36 

37 

38 

39 

40 

41 

42 

43 

44 
45-46 


shown  below: 


Description 

month  of  fruiting 
day  of  fruiting 
year  of  fruiting 

size  of  fruit  in  inches;  coded  as  1=2  1/4, 
2=2  1/2,  3=2  3/4,  4=3. 
quality;  coded  as  l=very  good,  2=good, 
3=fair,  4=poor. 

shape  of  fruit;  coded  as  l=oblate, 

2=oblong,  3=conical,  4=round. 
color  of  fruit;  coded  as  l=light  red, 

2 = medium 

red,  3=dark  red,  4=yellow. 

flesh  type;  coded  as  l=hard,  2=tough, 

3=crisp,  4=tender. 

season;  coded  as:  l=late  July,  2=early 
August,  3=late  August,  4=early  September, 
5=late  September,  6=0ctober. 
mildew  rating;  field  rating  of  mildew 
coded  as  l=highly  resistant,  2=resistant, 
3=moderately  resistant,  4=suscept ible , 
5=highly  susceptible, 
spur  type;  coded  as  l=yes,  2=no. 
scab  resistance ; coded  for  27  different 
sources  used  for  introducing 
resistance  to  scab  (single  or  multiple 
genes) . 

01  = Antonovka,  02  = Antonovka  Monassir, 

03  = atrosanguinea  pit,  04  = 
atrosanguinea  3 type,  05  = baccata 
jackii,  06  = floribunda, 

07  = English,  08  = Hbl,  09  = Hb2,  10  = 
Jonsib,  11  = MA4,  12  = MA8,  13  = MA16, 

14  = MA1255,  15  = micromalus  (pit), 

16  = micromalus  3 type,  17  = prunifolia, 

18  = prunifolia  microcarpa,  19  = prunifolia 
xanthocarpa,  20  = Russian  (differential), 

21  = Russian  (non-differential), 

22  = Sargenti,  23  = sieboldii, 

24  = zurai  calocarpa, 

25  = dwarf  spreading  toringo, 

26  = other;  27  = scab  susceptible. 
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A sample  of 

some  entries 

of  the  file 

HORT  follows: 

Column 

t— H 

i — 1 

«— H 

«— l 

r— I 

C\1 

CO 

Germplasm 

o 

Coded 

Scab 

entry 

¥ 

information 

resis- 

tance 

PAR4T236 

ROCK24-224 

ROCK23-57 

101378333226 

04 

PAR4T8 

ROCK25-16 

NJ159155 

080378334231 

06 

PAR5T134 

RALLS 

HAR5T8 

1020  321136 

06 

PAR5T170 

RALLS 

HAR5T8 

100578334236 

06 

PAR5T176 

RALLS 

TNR10T11 

0928  3231  5 

06 

PAR5T202 

DELICIOUSM 

PWR4T5 

091878333224 

15 

PAR5T203 

DELICIOUSM 

PWR4T5 

091878421134 

15 

PAR8T186 

SUMMERRED 

PWR14T40 

081977324133 

06 

Note  - The  priorities  in  assignment  of  information  by  the 
Pascal  Compiler  varies  depending  on  the  content  of  the  space. 
In  some,  a blank  space  is  assigned  a lower  priority  than  a 
space  containing  a number  or  a character.  Because  of  this 
peculiarity  in  compiling,  we  see  in  the  previous  example  that 
PAR4T8  is  listed  after  PAR4T236. 

4.  File  Name:INTEST-  information  file  on  the  response  of 
the  germplasm  screened  for  resistance  to  codling  moth, 
Laspyresia  pomonella  L.  (CM),  plum  curculio,  Conotraohelus 
nenuphar  (Herbst)  (PC),  apple  maggot,  Rhagoletis  pomonella 
(Walsh)  (AM),  redbanded  leafroller,  Argyrotaenia  velutinana 
(Walker)  (RB),  according  to  laboratory  procedures  described 
by  Goonewardene  et  al.  (1975,1979)  is  stored  in  this  file. 

The  amount  of  damage  to  the  fruit  for  each  entry  caused  by 
each  of  these  insects  and  the  control  (cultivar  'Jonathan') 
is  recorded.  The  significance  of  the  differences  in  damage 
between  the  control  and  the  germplasm  entry  is  noted  as  the 
significance  rating  (SI  = significant  at  P<0.05,  S2  = 
significant  at  P<0.01,  and  NS  = nonsignificant,  that  is 
P>0 . 05 ) . 


An  example  of  the  data  in  the  file  INTEST 
follows : 


Column 


LncncMcocncoiooco  r^-  o h-  cm 

■^t-cmcmcmcocotI'M'  in  lq  in  to 


w to 


E 

0 

0 

CO 

d 

if) 

o 

o 

03 

z 

03 

if) 

to 

_C 

03 

_C 

CO 

CL 

E 

ofi 

*55 

0 

C 

"55 

X 

0 

03 

O 

if) 

X 

0 

C 

V) 

X 

0 

03 

O 

CO 

o 

C 

0 

ZE 

d> 

ZE 

d> 

o 

d) 

o 

03 

5 

03 

CD 

03 

O 

<D 

> 

o 

o 

o 

</> 

D_ 

CL 

<7) 

< 

CO 

< 

CO 

cc 

co 

CBR1T29  791  07.0NS  03.0NS 

CBR2T43  761  00.8NS  00.1  NS 

CBR4T20  781  00.9NS  00.9NS  00.5NS  12.9NS  02.0NS  00.3S1  01.0NS 
CBR4T20  791  00.0S1  00.0S1 

CBR4T44  751  02.6NS  03.1  NS  01.6NS  02.9NS  03.8NS  02.6NS  00.3NS 
CCR03T1 1 761  02.3NS  16.5NS  01.6NS  00.5NS01.9NS 


Note-  CBR4T20  was  tested  for  CM  in  1978  and  retested  in 
1979. 

5.  File  name:  ERTEST  - stores  information  on  the  mean 
numbers  of  European  red  mites  found  for  an  entry  compared 
with  those  numbers  found  on  HCR21T200  (control)  in  resistance 
studies  conducted  according  to  the  procedures  described  by 
Goonewardene  et  al.  (1976).  Differences  between  means  between 
a germplasra  entry  and  the  control,  HCR21T200,  are  reported  as 
SI  = significant  at  P<0.05,  S2  = significant  at  P<0.01,  and 
NS  = nonsignificant  that  is  P>0.05. 

The  example  of  the  entries  in  the  file  ERTEST  is  given  below: 


Column  rH 

t — i 

CO 

CM 

00 

CM 

Gerraplasm 

Year 

Mite 

Significance 

entry 

of  test 

count 

rating 

HCR21T200 

80B 

43.3 

NS 

HCR23T125 

74-75 

04.9 

SI 

HCR7T163 

74-75 

03.7 

SI 

HCR9T48 

80A 

01.1 

SI 

NJ123056 

79 

20.0 

NS 

NJ123056 

80B 

20.0 

SI 

NJ154953 

79 

37.5 

NS 

NJ154955 

80B 

37.5 

NS 

NJ159155 

79 

22.8 

NS 

NJ139155 

80B 

22,8 

SI 

NJ25 

74-75 

07.8 

NS 

NJ43 

79 

33.6 

NS 

NJ43 

80B 

33.6 

NS 
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6.  Program:  SRTABRV  - sorts  the  randomly  listed, 
alphanumeric,  abbreviated  entries  from  the  input  file  XPEDAB. 
The  program  outputs  a listing  of  the  abbreviated  names  of 
entries  and  their  complete  names  listed  alphanumerically. 

7.  File  Name:  STABRVA  - a file  containing  the  output  from 
program  SRTABRV  is  organized  as  follows: 

The  output  below  is  the  example  of  entries  in  file  STABRVA 


Column  ^ 


Germplasm  entry^ 

Names  of  parents 

ANNA 

— 

ANNA 

BMDEL 

= 

BUD  MUTATION  OF  DELICIOUS 

EDEL 

= 

EARLY  DELICIOUS  //*SYN.  WITH 

EARLIRED 

DELICIOUS*// 

MACOUN  NJ28 

= 

(MACOUN  * NJ28) 

8.  Program:  APSORT  - sorts  the  randomly  listed  alphanumeric 
progeny/parent  combinations  from  the  input  file  APTRACE 
through  a binary  insertion  sorting  routine.  The  output  is 
listed  in  the  file  MSTRSRT. 

9.  File  name:  MSTRSRT  -file  has  the  output  from  the  program 
APSORT  that  sorts  progeny/parent  combinations  and  lists  them 
by  progeny  name. 

The  file  follows  the  same  format  as  was  used  for  APTRACE.  The 
data  from  this  file  is  used  as  the  input  for  the  program 

FORMS. 

10.  Program:  FORMS  - groups  all  progenies  from  the  input  file 
MSTRSRT,  that  have  identical  male  and  female  parents.  The 
output  is  generalized  in  two  formats:  (1)  a user  readable 
reference  document  and  (2)  a program  readable  file  (See  file 
description  SYNFILE). 

This  program  initiates  a single  trace  of  a pedigree  for  all 
progenies  having  the  same  pedigree.  Traces  for  progenies  with 
the  same  pedigree  will  not  be  generated  but  noted  after  the 
respective  pedigree  trace  is  completed. 


^In  the  examples  cited,  only  the  abbrev- 
iated names  of  germplasm  entries  are  used; 
for  example  (MACOUN  *NJ28). 


The  program  reads  the  file  for  the  name  of  germplasm  as 
follows : 


34.000= 

35.000= 

PROCEDURE  READ1 2(VAR  S : NAMES);  (*this*) 

36.000= 

(^procedure*) 

37.000= 

VAR 

(*will  get*) 

38.000= 

A :NAMES; 

(*and  read  12 

39.000  = 

I : INTEGER; 

^characters* 

40.000= 

CH  : CHAR; 

(*from  the*) 

41.000= 

(*input  synfi' 

42.000= 

BEGIN 

43.000  = 

FOR  I : = 1 TO 

12  DO 

44.000= 

A[I]  : = ; 

(*A  designates  the*) 

45.000= 

I : = 0; 

(*germplasm  entry*) 

46.000= 

WHILE  (NOT  EOLN 

(SYNFILE))  AND  (I  < 12)  DO 

47.000= 

BEGIN 

48.000= 

I : = I + 1; 

49.000= 

READ( SYNFILE 

CH); 

50.000= 

A [ I ] : = CH; 

51.000= 

END; 

52.000= 

S : = A; 

53.000= 

END; 

54.000= 

55.000= 

Note  that  through  the  use  of  Program  FORMS  one  creates  a file 
of  synonyms  (SYNFILE). 

11.  File  Name:  SYNFILE  - lists  all  entries  having  identical 
pedigrees  and  listing  them  starting  in  column  39  as  shown  in 
the  following  example: 


Column  <- 

CO 

to 

C\J 

O) 

CO 

Germplasm 

Female 

Male 

Names  by  which  known  parents  have 

entry 

parent 

parent 

been  designated  at  difterent 

stages  of  previous  breeding 

programs 

HER3T50 

PWR22T12 

PRIMA 

HER3T65 

HER3T66 

HER3T79 

HER3T80 

HER4T112 

PRIMA 

PWR22T12 

HER4T56 

HER4T177 

HAR19T194 

HCR23T94 

HER4T182 

HER5T106 

HCR6T149 

HCR5T78 

HER7T142 

TSR13T130 

HAR30T106 

12.  Program:  SORT:  Sorts  randomly  listed  alphanumeric 
entries  in  each  of  the  files,  HORT,  INTEST,  and 
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ERTEST  and  arranges  the  entries  in  alphabetical  order.  We 
used  the  bubblesort  procedure  which  sorts  N entries  at  a 
speed  proportional  to  the  square  of  N (Nz),  but  if  a more 
rapid  sort  is  desired  or  if  there  is  a need  to  sort  a very 
large  number  of  entries,  another  sorting  procedure  should  be 
used  such  as  QUICKSORT  (O(NlogN))  as  described  by  Wirth 
(1976).  Sample  output  of  the  program  SORT  were  shown  earlier 
in  files  HORT,  INTEST,  and  ERTEST. 

13.  Program:  MERGE  - the  Pascal  code  for  merging  the  data 
from  the  files  HORT,  INTEST,  and  ERTEST  for  each  entry 
entry  and  storing  it  on  1 line  of  the  array'*  MERGAR 
is  shown. 


100.000=PROCEDURE  MERGA(VAR  LINE  : INTEGER); 

101.000=VAR  (*APPLE-holds*) 


102.000= 

I , J ,K.  : INTEGER; 

(*hort*) 

103.000= 

EXIT  : BOOLEAN; 

^characteristic*) 

104.000= 

(* IN SECT- insect*) 

105.000=BEGIN 

(*resistance  and*)) 

106.000= 

EXIT  : =FALSE; 

(*MITE-mite*) 

(*resistance*) 

( *inf ormation*) 

107.000= 

WHILE  (NOT  EXIT)  AND 

(NAME A [ LINE ] >=NAMEI [ CT ] ) AND 

(CT<N)  DO 

108.000= 

BEGIN 

109.000= 

IF  (NAMEA[ LINE ] =NAMEI [ CT ] ) ' 

THEN  (*NAMEA-Entry*) 

110.000  = 

BEGIN 

(*f rom  HORT, *) 

111.000= 

EXIT  : = TRUE; 

(*NAME1 -entry*) 

112.000= 

END; 

(*f  rom*) 

1 13.000  = 

(*INTEST,*) 

114.000= 

CT  : = CT+1; 

(*NAMEM-*) 

1 15.000  = 

END; 

(*entry*) 

116.000= 

FOR  J : = 1 to  48  DO 

(*f rom*) 

(*ERTEST*) 

117.000= 

MERGAR [LINE] [ J ] : 

= APPLE [ J 

,LINE ] ; (*MERGAR-*) 

118.000= 

IF  EXIT  THEN 

(*merged*) 

119.000= 

BEGIN 

(*inf orma-*) 

120.000= 

FOR  J :=  49  TO  124  DO 

(*tion*) 

121.000= 

MERGAR [LINE] [J] 

: = INSECT  [ J-38,CT ] ;(*If  *) 

(*found  a*) 

122.000  = 

END 

(*cor responding*) 

123.000= 

ELSE 

(*entry  in*) 

124.000 

BEGIN 

(*insect*) 

^information*) 


^Compiler  restrictions  necessitated  de- 
claring an  array  of  350  entries  with  200 
characters /entry . With  our  data  we  were 
obligated  to  run  the  data  as  5 operations. 


(*merge  it  with*) 
(*hort*) 

( information*) 

125.000=  NOINS  : = TRUE;(*NOINS-no  insect  info  found*) 

126.000=  END; 

127.000= 


128.000= 

129.000= 

130.000= 

131.000= 

132.00= 

133.00= 

134.000= 

135.000= 

136.000= 

137.000= 


EXIT  : = FALSE; 

WHILE  (NOT  EXIT)  AND(NAMEA[  LINE  ] >=NAMEM  [CT2  ] ) 

(CT2<N)  DO 
BEGIN 

IF  (NAMEA[ LINE ] =NAMEM  [ CT2  ] ) THEN  (*If  found*) 
EXIT  : = TRUE;  (*at  corresponding*) 

CT2  : = CT2+1;  (*entry  in  mite*) 

END;  (information*) 

(*merge*) 

IF  EXIT  THEN 
BEGIN 


138.000=  FOR  J : = 125  to  159  DO 

139.000=  MERGAR[LINE] [J]  : = MITE [ J-108 ,CT2 ] ; 

140.000=  END; 

141 . 000=END; 

142.000= 


This  program  MERGE  will  generate  2 types  of  output  files: 

(1)  File:  CMFIL  for  use  as  a database  for  program  PDTRACE, 
along  with  SYNFILE. 

(2)  File:  DMFIL  for  use  as  a reference  document  and  the 
database  for  program  SELECT. 

14.  File  name:  CMFIL:  A master  file  created  by  using  the 
program  SORT  to  sort  and  list  entries  from  the  file  HORT,  and 
then  merging  the  data  from  the  INTEST  and  ERTEST  files  into 
this  sorted  listing  through  the  use  of  the  program  MERGE. 
CMFIL  is  used  as  the  data  base  in  future  programs. 

The  file  is  organized  as  follows: 

Column  Description 


1-10 

germplasm  entry 

11-20 

female  parent 

21-30 

male  parent 

31-46 

horticultural  information 
as  described  for  file 

HORT 

49-101 

insect  information  as 
described  for  file  INTEST 

127-146 

mite  information  as 
described  for  file  ERTEST 
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An  example  of  the  data  in  the  file  CMFIL  follows: 


£ 

CM 

CO 

03 

CBR1T22 

MCINTOSH 

TNR6T13 

082569121243 

06 

781 

01  5NS 

01.  INS 

00.6NS 

01.7NS 

01  ONS 

00.2NS 

07. ONS 

CBR1T29 

MCINTOSH 

TNR6T13 

090269221244 

06 

781 

00.1  NS 

00.7NS 

01. INS 

04  9NS 

CBR2T43 

MCINTOSH 

TNR6T13 

082569123133 

06 

781 

00.9NS 

00.9NS 

00.5NS 

12.9NS 

02.0NS 

00.3S1 

01  ONS 

CBR3T2 

597NJ1 

NJ3559S9 

082569334233 

06 

15.  File  name:  DMFIL  - lists  merged  data  from  files 

HORT,  INTEST  and  ERTEST  in  the  uncoded  and  expanded  form.  The 

following  is  an  example  of  information  contained  in  the  file. 

SELECTION  : CBR1T22 
FEMALE  PARENT  : MCINTOSH 
MALE  PARENT:  TNR6T13 
THE  DATE  OF  TEST  : 08/25/69 
SIZE:  2.25  INCHES 
QUALITY:  GOOD 
SHAPE:  OBLATE 
COLOR:  MED  RED 
FLESH  TYPE:  TENDER 
SEASON:  LATE  AUGUST 
MILDEW  RATING: 

SPUR  TYPE:  NO 

RESISTANCE  SOURCES  : FLORIBUNDA 
THE  YEAR  OF  INSECT  TEST:  1977 
TEST  NUMBER:  1 
CODLING  MOTH  STINGS:  01.5NS 
CODLING  MOTH  EXIT  HOLES  : 01. INS 
PLUM  CURCULIO  FEEDING  SCARS  : 00.6NS 
PLUM  CURCULIO  EXIT  HOLES  : 01.7NS 
APPLE  MAGGOT  STINGS:  01.0NS 
APPLE  MAGGOT  EXIT  HOLES  : 00.2NS 
RED  BANDED  LEAFROLLER  SCARS  : 07.0NS 
RED  BANDED  LARVAE  : 

RED  BANDED  ADULTS  : 

MITE  INFORMATION: 

Note:  BLANKS  DENOTE  NO  INFORMATION  AVAILABLE 


(For  some  entries  test  data  from  screening  for  resistance  to 
insects  and/or  mites  were  unavailable;  for  example,  CBR  3T2. 
To  handle  such  situations,  blank  spaces  were  provided  in 
the  program.) 


When  a data  set  is  unavailable,  for  example  insect  and/or 
mite  information,  the  program  provides  blank  spaces,  for 
CBR1T22  information  on  MILDEW,  test  results  on  REDBANDED 
LARVAE,  and  REDBANDED  ADULTS  are  lacking.  Therefore,  spaces 
reserved  for  such  information  are  left  blank. 


16.  Program:  SELECT  - Uses  file  DMFIL  as  the  database  for 
input.  The  program  can  be  used  to  select  individuals  and 
produce  an  output  with  the  certain  characteristics. 

A.  (I)  with  a specified  parent 

(II)  with  specific  disease  resistance 

(III)  with  a specified  shape 
or  size  or  color  or  quality  or  fleshtype 
or  season  of  harvest  or  mildew  rating 
or  all  of  the  above 

(IV)  with  insect  and/or  mite  resistance  and  a designated 

level  of  significance  (P  < .05  or  .01) 

An  example  of  an  output  generated  by  SELECT  is  shown  below: 

INDIVIDUALS  WITH  SIZE>  2.25  INCHES 
HAR1T11 
HAR1T25 
HAR1T71 
HAR2T46 
HAR2T128 
HAR2T164 
HAR2T169 
HAR2T146 

INDIVIDUALS  WITH  GOOD  QUALITY 
HAR1T11 
HAR1T25 
HAR2T128 
HAR2T159 
HAR2T178 

THE  SEASON  OF  HARVEST  IS  OCTOBER 
HAR6T51 
CBR4T10 
CBR9T28 

The  following  is  an  example  of  the  Pascal  code  used  to  select 
the  germplasm  entry  for  a specified  property  or  trait.  The 
procedure,  to  choose  individuals  with  a desired  property, 
uses  a "string"  type  data.  The  string  describes  the  property, 
such  as  good,  poor,  and  so  forth. 


3.00  inches 
3.00  inches 
3.00  inches 
2.7  5 inches 
2.5  inches 
2.5  inches 
2.7  5 inches 
2.5  inches 
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PROCEDURE  CHQU I AL( STRING  : STYPE); 
VAR 


I , J : INTEGER; 

BEGIN 

WRIT ELN(_INDI VI DUALS  WITH_,  STRING  ,_QUALITY_); 
FOR  I:  = 1 TO  N DO 


BEGIN 

IF  RECARR [ I ] . QUIAL=STRING  THEN 
WRITELN( RECARR  [ I ] .APPLE) ; 

END; 


WRITELN ; 

END;  ( *PROCEDURE*) 


(*Recarr-holds*) 
(*al  1 of  the*) 
(*character-*) 
(*istics*) 

(*of  the*) 
(*gerraplasm*) 

( *exaraple*) 
(*RECARR[I] .*) 
(*Quial=quality*) 


B.  (I)  From  a list  of  disease  resistance  germplasra  entries 
this  program  will  also  generate: 


1.  A list  of  all  parents  used  in  these  germplasm  entries; 

2.  The  parents  most  frequently  occurring  parents  in  the 
group  of  entries; 

3.  Calculate  the  percentage  of  times  they  (parents)  occur. 

(II)  Repeats  steps  (I)  1.,  2.,  and  3.  for  entries  with 
pest  resistance , for  example  insect  or  mite  resistance. 


The  Pascal  code  for  listing  the  parents  in  a group  of 
germplasra  entries  and  calculating  the  percentage  frequency  of 
their  occurrence  is  described  below: 


PROCEDURE  PERCENT(NAME: STYPE); 

VAR 

I , J ,K  : INTEGER; 

PER  : REAL; 

BEGIN 

J : =0 ; 

FOR  I : = 1 DO  200  00 

IF  COUNTLIST [ I ] <>0  then  (*countlist-holds  count  of*) 

(*how  frequent  each*) 

J : =J+C0UNTLIST [ I ] ; (*parent  occurred*) 

I:=l; 

WHILE  (I<200)  and  (PARLIST[I  ] ONAME)  DO  (*PARLIST-list*) 

(*of  parents*) 

BEGIN 

I:  =1+1; 

IF  PARLIST [ I ] =NAME  THEN  (*countlist-how*) 

(*many  times*) 
(*each  parent*) 


K : =COUNTLIST [ I ] ; 


( *occurred*) 


END; 

IF  JOO  THEN 

BEGIN  (*per-variable  used*) 

(*to  calculate  the*) 

PER  :=(K/J)*100;  (*percentage*) 

WRITELN(_THE  PARENT_,NAME, : OCCURS_,PER:5:3,_PERCENT  OF 
THE  LINE_) ; 

END 

ELSE 

WRITELN(_  THE  DATA  ON  PARENTS  NAMES  IS  NOT  AVAILABLE_); 

END; 

The  sample  of  the  output: 

INDIVIDUALS  WITH  ZUMI  CALOCARPA  RESISTANCE 


CCR3T18 

CCR3T27 

CCR3T35 

CCR3T44 

CCR3T47 

CCR3T54 

CDR3T3 

CDR3T17 

CDR5T8 

CDR5T9 

FEMALE  PARENT  : MCINTOSH 


CCR3T18 

MCINTOSH 

TSR4T237 

CCR3T27 

MCINTOSH 

TSR4T237 

CCR3T35 

MCINTOSH 

TSR4T237 

CCR3T44 

MCINTOSH 

TSR4T237 

CCR3T47 

MCINTOSH 

TSR4T237 

CCR3T54 

MCINTOSH 

TSR4T237 

MALE  PARENT  : 

MCINTOSH 

None 

FEMALE  PARENT 

: TNR2T52 

None 

MALE  PARENT  : 

TNR2T52 

CDR3T3 

TSR5T27 

TNR2T52 

FEMALE  PARENT 

: TSR5T27 
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CDR3T3 

TSR5T27 

TNR2T52 

MALE  PARENT  : 

TSR5T27 

None 

FEMALE  PARENT 

: TSR4T237 

CDR5TI7 

TSR4T237 

NJ28 

CDR5T8 

TSR4T237 

NJ28 

CDR5T9 

TSR4T237 

NJ28 

MALE  PARENT  : 

TSR4T237 

CCR3T18 

MCINTOSH 

TSR4T237 

CCR3T27 

MCINTOSH 

TSR4T237 

CCR3T35 

MCINTOSH 

TSR4T237 

CCR3T44 

MCINTOSH 

TSR4T237 

CCR3T47 

MCINTOSH 

TSR4T237 

CCR3T34 

MCINTOSH 

TSR4T237 

FEMALE  PARENT 

: NJ28 

None 

MALE  PARENT  : 

NJ28 

CDR5T17 

TSR4T237 

NJ28 

CDR5T8 

TSR4T237 

NJ28 

CDR5T9 

TSR4T237 

NJ28 

THE  MOST  COMMON  PARENT  IS  TSR4T237 
THE  NEXT  MOST  COMMON  PARENT  IS  MCINTOSH 


17.  Program:  PDTRACE  - The  information  from  file  CMFIL  is 
used  to  trace  the  required  pedigrees.  The  procedure 
"PEDIGREE"  (shown  below)  uses  a binary  tree  to  develop  the 
complete  pedigree  of  any  particular  germplasm  entry.  Then, 
the  file  SYNFILE  is  searched  linearly  to  get  a listing  of 
other  germplasm  entries  with  the  identical  pedigree. 


139.000=  PROCEDURE  PEDIGREE(VAR  HDSDPTR 

SDPTR); 


140.000= 

141.000= 

142.000= 

144.000= 

VAR 

145.000- 

NODE, ONE  : SDP1 

146.000= 

MOTHER, FATHER  : 

147.000= 

148.000= 

BEGIN 

149.000= 

IF  HDSDPTR  < 

150.000= 

THEN 

151.000= 

BEGIN 

NAMES; 


(*this  procedure*) 
(*will  use  a*) 
(*linked  list  to*) 
(*find  the*) 
(*complete*) 
(*pedigree  of  a*) 
(*particular*) 
(*seedling*) 


(*of  pointers  to  germplasm*) 
(*entr ies*) 


152.000= 

153.000= 

154.000= 

155.000= 

156.000= 

157.000= 

159.000= 

160.000= 

161.000= 

162.000= 

163.000= 

164.000= 

165.000= 

166.000= 

167.000= 

168.000= 

169.000= 

170.000= 

171.000= 

172.000  = 

173.000  = 


FIND(HDSDPTR". PEDSEED, MOTHER, FATHER) ; 
(*BINARY  SEARCH*) 


NEW(NODE);(*initiates  pointer  to  the  new*) 
(*gerraplasra  entry*) 

NODE".  FEMALEPTR  : = NIL;(*pointer  to  the*) 

(*female  parent*) 

= NIL;  (*pointer  to  the*) 
(*male  parent*) 

= MOTHER;  (*append  new*) 

(*entry  to  the  end*) 
(*of  the  list*) 


NODE".  MALEPTR 


NODE".  PEDSEED 


( *SET  MALE*) 
NEW(ONE) ; 

ONE  ".FEMALEPTR  : 
ONE". MALEPTR  : = 
ONE". PEDSEED  : = 

HDSDPTR". MALEPTR 


(*do  the  same  for  the*) 
(*male  parent*) 

= NIL; 

NIL; 

FATHER; 

: = ONE; 


IF  (MOTHER  < > BLANKS)  THEN 

BEGIN 

PEDIGREE(NODE) ; 

END; 

IF  FATHER  < > BLANKS  THEN 

PEDIGREE  (ONE); 


END; 

END; 


A companion  procedure  TRAVERSE  is  used  to  find  the  names  of 
the  ancestors  of  each  germplasra  entry  and  print  them  in  the 
appropriate  position  of  a "family  tree"  as  dictated  by  the 
procedure  PAGEOUT. 


254.000=PROCEDURE  TRAVERSE  (TREE  : SDPTR;  LEVEL  : INTEGER); 
255.000= 

256.000= 

257.000= 

258.000= 


259.000= 
260.000=  BEGIN 
261.000=  IF 

262.000= 

TREE  < > NIL 

263.000=  BEGIN 

264.000= 

TRAVERSE(TREE". FEMALEPTR,  LEVEL+1); 

265.000  = 

IF  (TREE".  PEDSEED  < > UNKNOWN 

THEN 

266.000= 

IF  (TREE".  PEDSEED  < > CHANCE 

THEN 

267.000= 

IF  (TREE".  PEDSEED<  > 

_) AND(TREE" . PEDSEED 
< > ANTMOM  ) THEN 


_) 

SDLNG  ) 
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268.000= 


269.000= 

270.000= 

271.000= 

272.000=END; 


WRITELN  (PROGEN , TREE ".PEDSEED, 
TREE". FEMALE  PTR" . PEDSEED, 
TREE". MALE  PTR".PEDSEED) 
PAGEOUT  (TREE". PEDSEED, LEVEL); 

TRAVERSE  (TREE".MALEPTR,  LEVEL+1); 

END 


Figure  2 shows  a typical  pedigree  trace  generated  by  this 
program. 

18.  File:  PROGEN:  The  program  PDTRACE  will  also  generate  the 
file  PROGEN  in  which  each  gerraplasm  entry  with  N ancestors  (N 
being  a constant)  is  allocated  the  required  number  of  lines. 
Pedigree  traces  are  read  from  right  to  left,  that  is  the 
earliest  generation  to  the  most  recent.  The  name  of  a single 
ancestor  and  its  known  parents  are  recorded  on  one  line;  for 
example,  four  ancestors  of  the  cultivar  'ALEXANDER'  (including 
itself)  will  be  represented  in  the  following  manner: 

MBAC 

MROBUSTA  MBAC  MPRUN 

MPRUN 

ALEX  MROBUSTA 


File  PROGEN  serves  as  the  data-base  for  the  programs  INBREED 
and  SEARCH. 

19.  Program:  INBREED  - Calculates  the  covariance  values  that 
are  statistical  measures  of  interrelationships  between 
individuals  in  a pedigree  (the  covariance  value  may  be  used 
to  calculate  the  inbreeding  coefficient  (F)  by  the  following 
formula:  F = C0V(I,J)-1.  The  value  of  covariance  can  range 
from  0 (not  related)  to  2 (selfed  or  fully  inbred).  The 
covariance  is  equal  to  twice  the  coefficient  of  coancestry 
(Falconer  1981)  for  those  who  may  use  the  probability  approach. 
The  general  procedure  used  for  calculating  the  covariance 
between  two  individuals,  is  shown  below: 


197.000=FOR  I : = 1 TO  N DO 
198.000=  FOR  J : = 1 TO  N DO 
199.000=  BEGIN 

200.000=  IF  COVAR  [I,J  ] <>  0 THEN 

201.000=  BEGIN 

202.000=  K : = ICODE  [ 1 , J ] ; 

203.000=  L : = ICODE  [2,J] 

204.000=  IF  (KOO)  THEN 

205.000=  BEGIN 

206.000=  COVAR [ I, K]  : = COVAR [ I , J ] /2; 


GREAT 

GRAND  GRAND  - 

PARENTS  PARENTS  PARENTS 


GREAT  - 
GREAT 
GRAND  ' 
PARENTS 


GREAT  - 
GREAT 
GREAT  - 
GRAND 
PARENTS 


MELBA 


O P.  SEEDLING 
OF 

MCINTOSH 


67634  


SONORA  UNKNOWN 


110037 


O P.  SEEDLING 

MELBA OF 

MCINTOSH 


NJ53 


C 


WILLIAMS 
STARR  


SPORT 
OF  ROME 

UNKNOWN 


2175-7 


MELBA 


OP  SEEDLING 
OF 

MCINTOSH 


117637 


NJ130 


WEALTHY 


STARR 


1018101 


47-147 


JONATHAN 


■F2  26829-2-2 


R.  ROME SPORT 

OF  ROME 


MELBA O P.  SEEDLING 

OF  MCINTOSH 


9433  2-2 


9433  2-2 


ROME  BEAUTY 


M F L O R I B U N D A 821 


Figure  2. 

Diagram  of  the  pedigree  of  selection  2175-7 
= HCR2lT200=:Coop  13  ' Redfree'  traced  back 
5 generations.  The  seed  parent  is  listed 
first  in  each  cross. 
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207.000  = 
208.000= 
209.000= 

210.000= 

211.000= 

212.000= 

213.000= 

214.000= 

215.000= 

216.000= 

217.000= 

218.000= 

219.000= 

220.000= 

221.000= 


INBRED  : =FALSE; 

ANSW  : =SEARCH  (I, INBRED); 

IF  INBRED  THEN  (*see  if  individual*) 
(*is  inbred*) 

CO VAR [ I , ANSW]  : = COVAR [ I , ANSW]+1 ; 
END  ; (*If  individual  is  inbred, use*) 
(*forraula  above*) 

IF  (LOO)  THEN 
BEGIN 

COVAR[I,L]  : = COVAR [ I , J ] /2; 

INBRED  : = FALSE; 

ANSW  : =SEARCH  (I, INBRED); 

IF  INBRED  THEN 

COVAR [I, ANSW]  : = 

COVAR [I, ANSW] +1; 

END; 

END; 

END; 


I,  J,  and  K are  the  numeric  representations  of  the  alpha- 
numeric or  numeric  names  of  individuals.  I and  K represent 
the  individuals  for  which  the  covariance  is  to  be  calculated. 
COVAR  [I,J]  is  the  covariance  value  between  individuals  I and 
J when  individual  K is  related  to  the  individual  J.  When  we 
consider  an  inbred  individual,  the  formula  changes  into  COVAR 
[1,1] =F  +1  where  F =[ 1/ 2(C0V(I , Female  Parent  )+cov(I,Male 

Parent  ))]  " ^l+FFemale  parent-^  ^1+FMale  parent-^* 

The  program  INBREED  outputs  a two-way  listing  of  individuals 
in  the  pedigree  along  with  the  respective  values  above  0. 

The  covariance  (I,  J)  = 2F.  The  alpha-numeric  names  of 
individuals  are  listed  both  on  the  first  line  (row)  and  in 
the  first  column  of  the  table.  The  covariance  values  are 
listed  within  the  table.  If  there  are  N individuals  in  a 
pedigree,  the  program  will  generate  X tables  (X=N/Z  where  N 
is  a variable  number  of  individuals,  and  Z is  a variable 
number  of  columns  per  table,  and  X is  rounded  to  the  next 
integer.  The  dimensions  of  the  table  thus  generated  are 
dependent  on  the  size  of  the  computer  screen  and/or  the  size 
of  the  paper  available  for  printing  out.  In  our  program  we 
used  Z=5  which  produced  an  output  as  shown  following: 


Gerraplasra 

Entry  PR14T102  PR7T19  PR7T41  PWR11TL46  RMELBA 


OR48T70 

0.50000 

OR48T84 

0.50000 

OR52T138 

OR53T63 

PWR14T102 

1.00000 

0.50000 

0.50000 

PR7T19 

PR7T41 

1.00000 

1.00000 

Program  INBREED  also  generates  the  list  of  all  individuals 
and  their  parents  included  in  the  listing  as  shown  below: 


i — 1 

co 

r— 1 

CM 

Germplasm  entry 

FEMALE 

MALE 

OR48T70 

PR14T104 

DEL 

0R48T84 

JONRED 

PR14T104 

0R52T138 

JON 

PR7T41 

0R53T52 

STARKGL 

14-126 

0R53T63 

STARKGL 

PR7T19 

PI172616 

PR14T102 

76-7 

CRAN 

PR7T19 

GDEL 

F2  26829-2-2 

PR7T41 

GDEL 

F2  26829-2-2 

The  presence  of  a blank  entry  in  the  female  and/or  male 
parent  columns  of  the  listing  indicates  an  unknown  parent. 

20.  File:  COV  - the  tables  of  covariance  coefficients  and 
lists  of  individuals  described  above  are  stored  in  this  file. 

21.  Program:  SEARCH  - Uses  file  PROGEN  as  a data 

base  to  locate  the  parents  of  those  entries  which  have  common 
recurring  parents.  A pedigree  is  traced  for  a maximum  of 
five  generations  and  such  pedigrees  are  then  searched 
linearly  for  recurring  parents  to  indicate  inbreeding.  The 
coding  used  is  as  described  as  follows: 
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FOR  J:=  1 TO  I DO 
BEGIN 

B I : =FALSE; 

B2  : =FALSE ; 

FOR  K : =J+ 1 TO  I DO 
BEGIN 

IF  (PARENT [J]<>_  _) AND  ( PARENT [J ] <>_CHANCE 

SDLNG  )THEN 


BEGIN 

B1 

: =ALF ACHEC  K(P ARENT [ J ] ); 

(*see  if  the*) 
(individual  is*) 

(*a  cultivar*) 

B2 

:=ALFACHECK( PARENT [K] ) ; 

IF 

B1  AND  B2  THEN 

IF  ( PARENT [ J ] =PARENT [ K ] ) 

THEN  (*see  if  the*) 

( *parent*) 
(*occurred  more 

FOUND  : =TRUE ; 

(*than  once*) 

END; 


END; 

END; 

IF  FOUND  THEN 

WRITELN(_  _,NAME): 

END 

The  output  generated  by  this  program  is  a list  of  progeny 
numbers  and/or  selections  chosen  as  shown  below. 

646-2 

688-100 

9AR2T144 

9AR4T47 

The  programs  described  in  this  publication  can  be  modified 
for  use  with  Apple  He  or  any  other  personal  computer  (PC) 
that  can  be  programmed  in  Pascal  and  by  making  needed  changes 
for  program  comparability.  The  reference  guide  for  Apple  lie 
Pascal  describes  in  detail  the  characteristics  of  the  unique 
syntax  features  to  be  used  for  the  specific  compiler.  If  used 
with  a PC,  the  entire  programs  cited  here  may  be  changed  to 
conform  to  the  level  of  interaction  needed  between  the 
program  and  its  user. 
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